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SUMMARY 


The Statistical Reporting Service assumed the responsibility for 

construction of a remotely sensed area sampling frame for 

selected areas In Southern Brazil* Original specifications for 

frame development contained a highly manual process* A 

transparent grid overlay was to be used to determine sample unit f 

location and stratification charadterlstics* The Sampling Frame 

Development Section was concerned with the following aspects of 

the original frame development specifications: (1) burden of 

manual imput, (2) restrictions on the sample unit size, (3) 

nonsampling errors, and (4) strata boundaries and definitions. 

The paper describes an automated approach to frame construction 
which was developed by the section. Noteworthy differences, 
from the original procedures Include: (1) a data base approach, 

(2) reduced manual input, (3) ability to change sample unit 
size, and (4) improved stra.lf icatlon ability. 


ill 
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Construction of a Remotely Sensed 
Area Sampling Frame for Southern Br^il 

Ron Fecso Van Johnson 

Wayne Gardner Steve Pavtasek 
Bob Hale 


INTRODUCTION 


ORIGINAL FRAME 
DEVELOPMENT PROCE- 
DURES 


The Statistical Reporting Service (SRS) has assximed the re- 
sponsibility for constructing the sampling frames for the 
Agriculture and Resources Inventory Surveys through Aerospace 
Remote Sensing (AgRISTARS) program* 

The AgRISTARS program is a joint undertaking of the United 
States Departments of Agriculture (USDA), Commerce (USDC), and 
Interior (USDI), tbe National Aeronautics and Space Adminis- 
tration (NASA) and the Agency for International Development 
(AID). Development, planning, and execution of the frame 
construction was done by the Sampling Frame Development Section 
(SFDS), Sampling Frames and Survey Research Branch (SRB), 
Statistical Research Division (SRD) at the Fairfax location. . 

The task was to construct a remotely sensed area sampling frame 
for selected areas in southern Brazil (The states of Parana, Rio 
Grande do Sul, and Santa Cacarlna.). The frame and an 
associated computerized system were constructed in a manner 
which will allow digitally stored sample unit Information and 
computerized sample selection. Sampling procedures and software 
were developed for latitudinal/longitudinal characterized 
populations in a way which permits variable sampling 
specifications. 


The definition of the sample units prior to this work had been 
a manual process. A transparent grid overlay was made to the 
desired cluster size and placed over the associ.ated base map 
(usually an Operational Navigation Chart). The sample units 
were then defined as the area enclosed by the individual grids* 
At this point, each grid was labeled for identification and 
assigned some euxlliary information for use in the creation of a 
stratified sampling design. The auxiliary Infotmatlon was based 
on visual interpretation of LANDSAT 1:1,000,000 transparencies 
which were '’registered" to a base map by aligning observable 
physical features conmK>n to both products. The information was 
percentage of agriculture estimated to the nearest ten percent. 

There are a number of problems associated with sample unit de- 
termination. These problems arc now listed for reference when 
describing the benefits of the automated system. 
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1* Burden of Knnunl Input 

Considorablo aanual input Is oiieosssry In the current pro- 
cedures* Overlays need to be dratm according to the 
variable base nap scales* Manual, interpretation of the 
auxiliary Infomatlon Is necessary for a very large number 
of sample units* The accuracy of this control data may 
become suspect If the burden ot time requirements forces 
quick estimation for many of the sample units* The 
relatively small cluster size may be difficult for Image 
Interpretation at a 1:1,000,000 scale* And not trivially, 
eac). of the many sample units will have to have a record 
ownually created with identification and control 
Infomatlon present* 

2* Restriction on Sample Unit Size 

The aumual procedure produces clusters of a given size* If 
future research shows that considerable efficiency can be 
gained from the uae of a different size, almost all the 
manual work will have to be redone to change to the new 
cluster size* 

3* Nonsampling Errors 

Potential causes of nonsaapllng error Include overlay dis- 
tortion and record keeping errors* Also changes in scale 
between base siaps might make complete coverage of the 
population difficult. 

4* Inefficient Strata Boundaries 

In the manual procedure, the strata boundaries are defined 
by the land uae of the clusters and the fixed edges of the 
clusters* If the cluster size is changed the boundaries 
may not be efficient or even useable* 


SAMPLING CONCEPTS The problem of AgRiSTARS' frame development and sample selection 

is considerably different from the area sampling frame methodol- 
ogy used by SRS in its U.S* agricultural estimation program* 

The programs differ mostly in the amount of labor intensive- 
ness* The SRS program requires manual collection of ground 
truth information which neeessitaens clearly recognizable 
boundaries for all sample units* As t result, the frame 
development and sample selection requite a great deal of manual 
input. Since the AgRISTARS pr^afram will not require physically 
observable aample unit boundaries, such as roads, rivers, and so 
forth, Che frame construction and sampling procedure has the 
potential for considerable automation* With large areas needing 
stratification, automation becomes an operational necessity* 

A stratified design was determined to ba best suited to meet the 
objectives of the AgRISTARS programs* This design allows for an 
afficlent distribution of a limited sample size to areas which 
are homogeneous in crop density and, as much as possible, in 


2 


ORIGINAL PAGE IS 
OF POOR QUALITY 

yield potential of the various crops being estimated. Since 
there is a possibllty that sample unit sizes or stratification 
procedures may change with continuing research, the development 
of an automated system which would be used to classify sample 
units Into strata Is advantageous. 

The first task to provide land use stratification was to deter- 
mine which characteristics associated with the desired esti- 
mates can be quantified based on the available materials. We 
had to determine the characteristics which could be Identified 
through inspection of LAMDSAT imagery, in conjunction with topo- 
graphic and soils maps. These characteristics should aid in the 
Improvement of the sampling deslgr , allocation, development of 
LAMDSAT Interpretation, signature extension, and yield model 
development . 

The availability of lANDSAT Imagery for the target region in 
Brazil was deceiuined. Paper and film products with various 
scales were ordered for selected sites and acquisition dates 
were staggered through the growing season. In addition to the 
imagery, the following materials were acquired: 

1. Operational Navigation Charts (ONC) at a 1:1,000,000 scale 
from the Aeronautical Charts and Information Command (ACIC) 
for use as a base map in registration of the frame and 
LANDSAT scenes. 

2. Soils maps for use in construction of the land use strata 
(LUS). 

3. Maps of the smallest political subdivisions (SPD). 

The following materials were also used when available: 

1. Topographic maps from the Defense Mapping Agency (mA}. 

2. Land use maps. 

3. Geological maps. 

4. Meteorological (temperature and precipitation) data from 
the National Oceanic and Atmospheric Administration (NOAA). 

5. Crop statistical data. 

6. ONC base physical features overlays from ACIC. 

Base map overlays when not available from ACIC, were made by the 
stratification group of SFDS. These overlays have major 
physical features such as rivers, lakes, and major road systems. 
The base map and its overlay contain enough information to 
register the LAMDSAT imagery for the area being stratified. 

The next step was to determine the useful auxiliary data which 
can be estimated from inspection of the materials. "Useful 
auxiliary data” in the statistical sense is information which 
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etn b« usttd to rodueo eh« orror of tho oseiaMros through **»- 
pllag •fflcloneloi. This error reduction enn oe eccompllehed 
through the use of e stratified design, probability proportional 
to size sampling, or through more efficient estimators such ss 
ratio or regression types. For this application, the search 
was restricted to information useful for stratification with re- 
spect to estimates and forecasts of corn and soybean production. 

The Brazil area frame was to provide the basis to estimate aiul 
forecast the p-oduction of corn and soybeans. The estimator of 
production for either commodity is the product of an acreage 
estimate and a yield per acre estimate. Thus, the information 
necessary for an effective stratification of the area frame was 
the density of the crop in the sasq>le unit and the yield poten- 
tial for the crop in the area of the sample unit. The Importance 
of items used for auxiliary information depends on the items' 
influence on the variability of the production forecast. 
Experience in the United States shows that the yield forecast 
contributes heavily to the variability of the production fore- 
cast. Research, conducted as part of the Large Acreage Crop 
Invectory Experiment (LACIE) , suggested that this relationship 
holds globally. 

The following questions were considered in trying to achieve an 
effective stratification: 

1. What would the sample unit be? 

2. Which characteristics could we use to stratify? 

3. How could we stratify for more than one forecast item? 

4. How could we use the auxiliary information to create strata? 

The grouping of sample units into strata could only be accom- 
plished with some knowledge of the sample unit content. There- 
fore, before pursuing stratification whe sampling unit had to be 
defined. Then the stratification was merely a clustering of 
sample units in an attempt to reduce the heterogeneity within 
earh stratum with respect to the overall population. 

The reporting unit was predefined as the remotely sensed 
information corresponding to a LANUSAT "pixel." Thus, the 
elmsents of the population of interest consisted of the pixels 
for the areas within the Brazilian states of Parana, Santa 
Catarina, and Rio Grande Do Sul. 

Since an area the size of a LANDSAT pixel (57 by 79 meters) can 
be considered flat, J.ements of the population of interest 
was defined to be the land areas associated with each pixel. 
Thus, disregarding the roll, pitch, axul yaw of the satellite, 
there is approximately a one-to-one correspondence between 
57 by 79 meter disjoint pieces of the earth and LANDSAT pixels 
(see Figure 1). 




SOURCE: LANDSAT USERS GUIDE 


The sanpling frame for the area of interest was, thus, defined 
as S7 by 79 meter rectangular areas (elonents) which do not 
overlap and which completely cover the area of interest. Note 
that the boundary of the frame will become altered to match the 
rectangular shape of the pixel or as later seen the rectangular 
shape of the sample units. The reporting unit fir each 
element is the LANDSAT pixel with a centerpoint c'.oseat to the 
centerpoint of the element. 

The actual frame from which samples will be drawn consists of 
the map corrected images now produced by NASA. Basically, the 
orittnal pixel data is “resampled'* by transforming the pixel 
‘ locations to a 57 by 37 meter grid system. The technique used 
can be found in the papers by Uilllamson (24) and Donbvan (3^). 

The frame used for AgRISTARS sampling consists of the 57 meter 
square areas (elements) of land which correspond to the resam- 
pled frame pixels described above. Thus, all land is conceptu- 
ally included In an element and no land is in two elements. The 
resampled MSS pixel information for the pixel closest to the 
land element is the reporting unit. 


17 Underscored numbers in parentheses refer to literature 
cited at the end of this report. 
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OPKMlQMMLm 

Currtnt toftnara Haiti thi flixtbllity of proeiiiiAg thi 
LAMDSAT iici. Thirifori, thi iiapllng diilgn for Britil w»i 
roitrlctid to on* in which the iimpllng unici contiln cluitiri 
of ilimonti which in nctingulir arrays of 193 by 117 ilomsnts, 
i.t. pixels. This llmlcacion Is expected to be eased to allovr 
a rectangular array of any site for future countries. Thus, it 
frame permitting flexibility of sample unit size would allow the 
Implementation of research findings on optimal cluster size. 

This research Is being conducted by USDA/SRS and NASA. 

• ' 

The AgRISTARS sumpllng unit was defined as a general rectan- 
gular array L meters long by H meters high where the L and H are 
determined by 

L - (Nj^KP^), and 

B . 

where 

■ pixel length (in meters), 

P^ * pixul height (in meters), 

N. ■ number of pixels along the length of the sample unit 
^ (S.U.), and 

■ number of pixels along the height of the sample unit. 

As array of elements of the earth's surface which had a corre- 
spondii^ array of reporting units (pixels) was defined as the 
sampling unit, these sample units must be defined in the area 
of eetimetion In such a way as to include all elements In a 
sample unit while not Including any In more than one sasiple 
unit. The methodology which will tmlquely define the sample 
units la desrrlbed In the section on sampling frame development. 

COMSTROCnON OF The characteristics used to stratify the sampling units were 

THE LAMfi USE DATA kept Independent of the sample units in a land use data base. 

EASE To develop the data haae, homogeneous lend use blocks were 

recorded in digital form. These blocks were defined by a 
vector of information which contained variables for percentage 
of land cultivated , percentage of cultivated land which is 
ubillaed for com and soybeans, field size, and soil type. The 
data for the vectors of Infometion were developed by using 
LAMDSAT color infrared Imagery (CIR) with crop statiatics for 
the aaallest political subdivisions (SPD) used to help Interpret 
the LAMDSAT slgnaturaa. 

Information frr major crop Items was avelleble at the SPD (Nunl- 
ciplo) level, but total land area was not. V The estimates for 

y Individual commodity satiaMtes were available for com, soy- 
beans, rice. Kidney (dry) beans, cotton, and sugar. The ramein- 
ing emps such as coffee, and wheat were included as other crops. 
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flMjor crops v«ro m 4« by tho Foroiga Agrlcultttrsl S^nrieo (FAS) 
and wera obtained froa cho data baao at tha Laboratory for 
Appiicationa of Raaota Sensing (LARS)* Total land area was 
eatlmatei by dlgltltlng the Munlclplo boundaries shown on aaps 
supplied by NASA, y The perceutage of cultivated land was 
computed by dividing the sum of all crop estimates by the total 
land arat. The percentage cultivated land devoted to com and 
aoybaans was pemputed by dividing areas planted to those crops 
by the sua of all crop area estimates. 

Crop calsndars» Operational Itavigation Charts (<HIC)» soil data» 
and climatological data were also obtained to help interpret the 
LANDSAT signatures* Information on the geography and cultural 
practices of southern Brazil was used to provide *. starting 
point for constructing the land use blocks* 

The land area of the three statea was broken down into 77 **work 
units" or "one degree squares" to provide for an even workload 
flow, tjj These 1* squarec define an area of land within a 
"quadrangle" formed by the intersection of pairs of li&ss of 
latitude and longitude. Data base information for political 
subdivisions and land use was developed within each 1* work 
unit. The use of 1* work units increased tha accuracy of 
registration and provided a data base which could be managed at 
reasonable cost with the automated system. 

Sasv naps and soils naps for each 1® sqv«ir« were drafted using 
ONC's and information prepared by Dr* Westln of South Dakota and 
provided by NASA* A cartographer (carto) placed the base maps 
on the LANDSAT scenes using rivers or other visible landmarks 
for orientation* Then, using the ancillary data and crop cal* 
endars to support the interpretation of the LANDSAT signatures» 
areas buving siailar characteristics such as field patterns, 
bare qoll, and so forth were delineated* The carto then coded 
the areas according to density of cultivation* 

After the initial breakdown and coiing were complete, an analyst 
reviewed the first carto* s work, made necessary changes and ad- 
ditions to the percentages cultivated, and coded the percentages 
of com and soybeans* This work unit then went back to a carto 
who made the corrections to land use boundaries, and modified 
boundaries due to varying soil types* Field size also was *sd 
in this step* 


Constructing Lan d 
Use Blocks 


y Several problems were encountered; 1) the maps showing 
Minlcipio boundaries had to be drafted; 2) the mapa arrived late 
in the construction process , 3) boundaries had to be estimated 
where naps overlapped, 4) a few Municiplos were miscoded, and S) 
eight Municiplos were Miltted entirely* Se*' Appeiullx 4 for 
additional information on data summarisation and the associated 
problems • 

y For a more complete description of the procedures for 
constructing the frame sea the outline of the :.tratlfleatlon 
procedures, the worksheets, and coding sheets included as 
appendices 1-3* 
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Following th« i«con<^ carto stop Cha work unit was again raviavad 
i>y an analyat* Thla ravlaw not only chackad for coaiplatanaaa 
aid conjiatancy within thti work unit, but also chackad for con- 
slstancy batwaan work units* (Tha analyst rev" «ws will be 
discussed further in tha section on quality control)* 

Once the review was complete, the final frame was drafted and 
all land use blocks coded* The frame was checked for accuracy 
and given a final review by a statistician* Once t'..e final 
frame was approved it was ready for digitization* 

Creating The Data At thla point ill the necessary Information was available to 
gaaa create the frame* However, this Information needed to be 

converted Into eomputer readable media* To accomplish this 
transformation graphic coc .’dinate digitizers were used* 

The digitizing equipment consists of: 

1* A tablet with a wire grid underneath to determine the 
location of each point on the tablet* 

2* A 16 key cursor to enter Information* 

3* A microprocessing controller* 

4. A dual floppy disk system to record and store the digitized 
information* 

Before each nap or overlay was digitized, dots were placed on 
the boundaries of every polygon idtenevec there was a noticeable 
change of direction. Therefore, curved lines were reptssmnted 
by a group of straight line segments . Each Kinlcipio or land 
use area was represented by an If sided polygon where N was the 
number *f dots needed to outline the arms. 

• 

Each point on the boundary of the polygon was represented by its 
x>y coordinate system and ms identified by polygon control and 
work unit Identification numbers* Additionally, information for 
machine edits was recorded for each point* The polygon control 
mnber Includf^s coding which defines the vector of InfomaMon* 
Hence, the vector of information for each land use block was 
aosaoelated with every, point determining chat polygon. 

The final frame was digitized by work unit and four files were 
created. These files, (a point file, area file, calibration 
file and calzbratlon area file) were used to check the 
digitization and verify the correctness of the point rile* V 
Points In the point file could be associated .with more than otim 
land use polygon* Hence, these matching points would be 
recorded aore than once* Matching points were averaged and the 
average value assigned to each polygon containing a matched 
point* The point averaging was done to ensure that ro small 


V Doeuaientation for the software used to edit and convert 
the points is Included as appendix 5* 
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areas were omitted or duplicated since very rarely will any 
point be recorded twice with exactly the same x and y coordinate. 

The final step tn creating the data base was to convert the x 
and y coordinates of each point Into latitude and longitude. 

The four points in the calibration file were used to determine a 
linear relationship between the x and y coordinates and latitude 
and longitude. This relationship was then used to convert all 
the points in the point file to the corresponding latitude and 
longitude. The file was stored on magnetic tape for later use 
in classifying the sample units. 

Quality control measures took on several forms throughout frame 
construction. Analysts and rsviewers visually checked the con- 
struction of land use blocks to ensure consistency in inter- 
preting the LANDSAT signatures. A data base cf crop estimates 
for the political subdivisions (municipios) was used to help 
Interpret LANDSAT signatures and to obtain characteristic values 
for land areas where scenes were missing or the imagery was 
obscured by cloud cover. Machine edits, check digits, and the 
point averaging program were used to ensure that the information 
in the digitized data files were complete and correct. Finally, 
the frame was evaluated using comparisons with ground truth 
data. This information was obtained from field observations 
made during travel through the three states In Brazil. 

The analyst reviews were not only steps in constructing the 
frame, but also acted as quality control measures. This process 
uncovered two basic problems with the LANDSAT imagery. First, 
the quality of the paper products received from NASA was very 
poor. Colors and tints varied considerably from scene to 
scene. Often crop signatures were undlscernlble, even when 
there was reasonable multitemporal coverage. Multitemporal 
coverage in itself was u problem because many areAs had only 
scenes stretched over several years. Second, registration of 
the 1* work unit to the LANDSAT scenes was approximate. Both 
the work unit jverlay and the LANDSAT scenes were on a scale of 
1:250,000, but when the overlay was placed on the LANDSAT scene, 
Che rivers, roads and so forth were not aligned except in small 
areas. Consequently, as the stratification work was being done, 
the overlay had to be moved around the LANDSAT scene to achieve 
correct registration for the small area being stratified. Dis- 
cussions between the cartos and analysts in resolving these 
problems and other difficulties led to a more consistent inter- 
pretation of the signatures. 

The second analyst review was used as a check for consistency 
among work units. In this step the land use blocks for adjacent 
%rork units were compared. Blocks which had identical codes for 
percentage of land u’.der cultivation were expected to have sim- 
ilar signatures on tho LANDSAT scenes. If the signatures were 
not consistent, codes were adjusted and/or boundaries redrawn to 
.Improve the homogeneity of the land use blocks. 

To assess and Improve staff capabilities of interpreting LANDSAT 
imagery outside expertise was sought. Bob ?ayne, with Lockheed 
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In Heutton, vlaletd th« Fairfax offiea and provtdad a erlcqua 
of conplatad work and additional training In Incarpratatlon. 

Daia King, no%i tha SRS, USDA, but forwarly an analyat with 
Loekhatd, also ravlawad tha work. 

Machlna adlta vara utad for quality control in tho 
procaaa* Ona progran addad tha araaa of tha land uaa poLygona 
and comparad thla valua to tha total araa of tha work unit* If 
tha two araaa wara not within a tolaranea of ^>2 pareant, tha 
digititad flla waa chackad for accuracy and corrnctad* A chack 
digit, conalatlng of tha unlta valua of tha aun of tha atrattfi- 
cation eodaa. waa includad to chack that codaa wara eorractly 
antarad Into tha data baaa* Finally, tha point avaraging 
progran anaurad that no land araa waa onittad fron tha frana or 
waa dupllcatad. 

Although ona of tha eonatralnta of thla frana conatruetlon 
projaet waa that a vlalt to Braall prior to tha work waa not 
poaalbla, a trip waa nada to avaluata tha work aftar it waa 
coaplatad. Wayna Oardnar and Van Johnaon aada tha avaluatlon 
trip to Braall. (Fabruary 2-20, 19B1). Thay wara aecoi^aalad 
by Dava Rieka, a gaographar with tha Envlronaantal Raaaarch 
Inatltuta of Michigan (BRIN). Gonprahanalva raaulea of that 
trip ara avallabla Li an Infomal trip raport by Gardnar and 
Johnaon (91 6/. Baaleally. 49 loeatlona wara vlaltad within 
tha thraa Braalllan Stataa aad obaarvatlona racordad. Thaaa 
obaarvatlona wo’^'a eoaparad to tha frana valuaa for parcantaga 
of Land cultivated, parenataga of com aad aoybaana and flald 
alM. Thaaa conparlaona ara shown in flguraa 2 t4. 

If tha frana valuaa and ebaarvad valuua aatehad axacrly. only 
tha blocks along tha diagonal (botton loft to top right) of tha 
charts would contain poaitlva valuaa, Slnea parfact corraapon- 
danca of valuaa la not axpactad', a high parcantaga of tha 
obaarvatlona falling in tha blocka on tha diagonal or ona block 
althar aids of it would provide an indication that tha frana 
stratlficatirn waa "good." Thla aaaiaaa that an observation on 
tha low and of a given range could fall in that range or in the 
praeadlng range or vlca-varaa. **Good** aaana that stratifica- 
tion should result In an overall Incraaaa in tha affiel^ey of 
tha saapla design. 

Notiea in Figure 2. that 26 of tha 49 obaarvatlona are on the 
diagonal, and an additional 16 ara on althar side* Thus 86 per- 
cent of tha obaarvatlona ara within tha specif lad lialts* lha 
r.oaparlsona in Figure 2 suggaat that atratlflad land use in tha 
lower rangaa of parcantaga cultivated aora often watched tha ob- 
served values than did tha land use classification in tha areas 
of aora intensive cultivation* This would saaa to indicate that 
stratlflars bacawa wore eonsarvatlva in thalr intarpratation of 
LAHDSAT signatures as tha intensity of cultivation increased. 


^ Additional oonsMnta can be found in (11) 
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figure 2; COMPARISON OF OBSERVED TO FRAME PERCENTAGE 


CULTIVATED FOR BRAZIL 



FIGURE 3: COMPARISON OF OBSERVED TO FRAME PERCENTAGE 
OF CORN/SOYBEANS FOR BRAZIL 


FRECXJENCY BLOCK CHART 
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FRAME PERCENTAGE OF CORN/SOYBEANS 




FIGURE 4: COMPARISON OF OBSERVED TO FRAME FIELD SIZES FOR BRAZIL 
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Figure 3 shows the coapsrlson of percentsge of corn and soybeans 
to total cultivated land. Forty of the 49 observations are In 
the blocks along the diagonal. The data in this table reflect 
the wide range of the percentage cultivated where almost 80 
percent of the observations liad values greater than 40 percent. 
Analysts could not determine the percentage of corn and soybeans 
from the LAl^DSAT signatures alone. Therefore, ancillary data 
was used as the major Indicator of the percentage of com and 
soybeans in each stratum block. These comparisons Indicate that 
the use of ancillary data was generally accui :te for coding the 
percentages of corn and soybeans. 

Figure 4 compares field size. Fields larger than 300 acres were 
not observed In the selected sites. However, some fields which 
may have been in that range were seen while traveling from one 
location to another. Moat of the fields observed ware small 
(less than 100 acres). The data Indicate that stratlflers had 
trouble In determining field size from the LAMDSAT scenes. 

The comparisons of observed to stratified values were generally 
very go^. A few areas where the stratification was quite a bit 
off were noticed. Most of the gross misstratlflcatlon was una- 
voidable because the signatures of some growing crops, such as 
sugarcane, could not be distinguished from natural vegetation. 

Also, the random planting of fields in vast plains areas caused 
and will continue to cause problems In determining the 
percentage of land cultivated In these areas. 

The Interpretation of signatures which evolved from the stra~ 
tlflcatlon process and which is shared by the analysts and 
reviewers la consistent with actual conditions* For example, 
signatures which were coded as bare soil did represent bare 
soil: areas of cultivation were cultivated, and so forth. 
However, signatures of some growing crops will contlcue to be 
confused with those of native grasses. 

The ancillary data for Brazil provided good Indications of 
relative levels of percentages cultivated, but should not be 
relied on solely for absolute levels. The percentage of com 
and soybeans to total crops, computed from the ancillary data, 
appeared to be a good Indicator of the acpual percentages* 

Hence, the ancillary data was reliable as an Indicator of 
relative values* 


Defining the sampling frame required chat a program be developed 
which would ccmipute the latitude and longitude of the center 
point of each sample unit in the target area* The geometric 
arguBwnts used In this program are contained In Appendix 6. 
Basically the algorithm Is an iterative procedure which takes 
Into account the curvature of the earth and the slle of the 
desired sample unit when positioning the rectangular sampling 
units over Che target area* 
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A computer «»8 us^ to define the sampling frame. This use of 
software to create the fr'use allows freedom in the choice of 
sample unit (cluster) size. In order to stratify the popula>- 
tion, a method was devised to assign auxiliary information tc 
the sample units. This objective was accomplished through 
prograsaning . 

Software was written to assign the auxiliary Information to each 
sampling unit In the frame* The location of the sampling unit 
was compared to the latitude/longitude polygon files which con” 
taiu land use and political subdivision information* The 
Information assigned to a sample unit was determined by the 
location of the center point of the sample unit* The program 
used to assign the auxiliary data Is in Appendix 5* 

Originally the auxiliary Information for a sample unit was to be 
created by allocating polygon Information .o a sample unit In 
proportion to the aaK>unt of the polygon In the sample unit* 
However, there were many inherent probloas with materials 
which would cause difficulties in registration. For these 
reasons the sample unit was assigned the auxiliary information 
ifhich corresponds to the polygon in %diich the center point of 
the sample unit falls. 


The Brazil sampling frame Is a data file which contains the 
center points of the cluster sampling units in the three state 
area of interest* Each of these sampling units also contains 
political subdivision and land use Information* 

A number of sampling designs are possible with this type of 
file* These designs Include: 

1. Simple random sampling* 

2* Stratified sampling by political subdivision and/or crop 
characteristics* 

3. Sampling with probability proportional to size (PPS) based 
on a cropland or crop-specific estimate. 

Additionally, the use of alternative estimators is a possibil- 
ity. These estimators could include both ratio and regression 
types. Stratification of the sampling frame based on the 
auxiliary information is also being considered by a number of 
organizations, including the SFDS. 
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I. Project Information ; 

!• Country: Brazil 

!• Area(a): States of Parana » Santa Catarina and Rio Grande 

do Sul 

3. Requlrementa: Stratify the following variables: 

(1) a« States 

b. Political subdivisions (munlclplos)-Co be entered 
Internally by automated grid software* 

(2) Agricultural land (evidence of cropping pattern) 

(3) Percentage cultivated 
(A) Soils type 

(5) Density of com and soybeans 

(6) Field size 

4. Material: Geographic data. Including cultural and climate 

data 

(1) Agricultural statistical data 

(2) Crop calendars 

(3) Soils maps 

(4) Satellite Imagery 

(3) Maps and map transparencies 

Primary work unit: A one degree latitude-longitude quadrangle 

outlined on prepared overlays* The overlays have an overlap on 
all sides which can be used for notea, observations, and so 
forth* 

Restrictions: The area frames for AgRISTARS are to be 

constructed compatible with an automated grid sampling schne* 

Since no ground truth Is available for the AgRISTARS frames, we 
will not be able to actually determine the accuracy of our stra- 
tification* Therefore, It Is essential that we be consistent In 
all of the work. Even If we mlsclasslfy a particular variable, 
we can still have a frame that Is efficient for sampling* The 
true Identity of the characteristic will have to be determined 
through sample data collection or interpretation. Review of 
Individual work to assure consistency throughout the frame Is 
absolutely necessary* 
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n. Stratification Guld«lln«»/ Information Unlqut to AgRISTARS 

1. Color-infrared paper scenes and map transparency overlays 
witVi a scale 1:250,000 will he used* The map 
transparencies will provide stratification variable 1-a 
(states) and also show coin bodies of water and roads for • 
general information* 

. 2* Land use rather than physical boundaries w'ill be 
emphasized * 

3* Each 1° work unit will have a worksheet (Stratification 
Procedure and Analysis Record) which contains spaca for 
identifying the work unit and the LANDSAT scenes used in 
the analysis and the persons responsible for each step in 
the procasa* Space is also provided for comants to aid 
in the analysis at each of the steps* A work number 
identifying the block should be entered on the overlay and 
the work sheet* When written cosMants refer to a specific 
block, the blocks should be numbered and tha number 
circled on the overlay* These reference numbers will not 
be entered into any system in the first land use 
stratification, but will be used to point out questionable 
areas to the analyst or reviewer* 

III* Stratification Work Procedures 

1* First Carto Procedure 

(1) Prepare the "Stratification Procedure and Analysis 
Record” work form for the one degree work unit* 

Enter all required information completely and legibly* 

(2) Place Che one degree base map transparency over the 
"best" (conalderlng coloring, cloud cover and date 
flown) LAMDSAT scene available for Che area to be 
stratified and tape to the scene* 

(3) Place the 1:250,000 soils map transparency over the 
baae map overlay; align and Cape to tha baae map 
overlay so that it does not slip and yat can be 
picked up to observe Che LANDSAT scene more closely 
if required* Review with assistance other 
topographic maps if available* In some Instances Che 
magnification process may obscure a portion of the 
LANDSAT scene* If this is too drastic, new overlays 
should be drafted* 

(4) 'Place clear overlay map transparency and tape to the 
base nap and soils map transparencies* 

(5) Review Operational Navigation Chart (ONC) and 
geographic research data (if available) and compare 
with Imagery features to achieve good orientation and 
write the geographic characCeristlcs of work unit on 
the work form* The land use nap obtained from Dr* 
Westln nay be used for supplementary InforMtlon* 
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( 6 ) As the first step in stratification, delineate 

definite crop land areas (field patterns vislMe) 
with a blue pencil oi\ the clear overlay. 

Concenlr»tte on hre.ui paileri'i areas color 
differences when de 1 Inoat Inp, and also pick out small 
areas wlifch are in striKinp, contrast tc the broad 
area. For example, a small densely cultivated valley 
inside a heavily wooded rollon. Code these cropland 
areas by percentage cultivated: less than 5, to 

JO, 20 to AO, AO to hO, (»0 to 80, 30 to 95, and 95 
plus (see coding sheet). All areas having less than 
5 percent cultivated land are coded A. Bodies of 
water will also be drawn out and coded as "O" (not 
sampled). Nininun block size for land is 9 square 
miles (3/4 inch^) and water Is 4 square miles (1/2 
Inch^). The latitude and longitude lines forming 
the boundary for the one degree block are the work 
unit boundaries and cultivated strata blocks should 
not cross these lines. 

(7) A unique strstif Icstlon problem Involves classifying 
and coding land within the one degree work unit which 
la in a country other than Brazil (i*e., Argentina). 
Since the entire one degree work unit mu»t be 
stratified and coded, the following guidelines 
apply: 

A. ’'Foreign" laud within 4 miles (approximately 1 
inch) or the border must be stratified and coded 
as though it were pert of Bracil. 

B. "Foreign*' land beyond the 4 mile limit can bo 
lumped together and coded "0" (not sampled). 

2. First Analyst Procedure 

(1) Working with a purple grease pencil, review the basic 
cultivation atratificacion done by the first carto. 
Make working notes in the Carto note section of the 
tforksheet and dlacuas any changes or modifications 
with the carto responsible for the cultivation 
stratification. 

(2) Use ONC's or other topographic naps which may show 

land use such as swamps, brush, and trees, etc., to 
further subdivide the areas coded "A” into three 
general groupings: (1) no potential for agriculture 

(arid, steep, or mountainous terrain), (2) littla if 
any potential for agriculture (woods, bruah and salt 
aarah), and (3) some potential for agriculture 
(grassland and othar). 

(3) The blocks of land stratitled by percentage of 
cultivated land in Carto step ons are to be coded for 
the percent of the cultivated land that is potential 
corn and soybeans. Ravlew the crop calendar and any 
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crop acr«ag« data or aupplaMncal crop aapa avallabla 
for tha rtglon balng atratlflad* Tha crop calendar 
will ahow uaual planting and harvaatlng datea for any 
competing cropa (cropa that may be planted on the 
aame land at the sane time)* Arrange all availablo 
imagery by month s'-remember Chat cropa are expected 
to change in specific fields between years as well as 
within years, if double or continuous cropping is 
normal practice* In Brazil Che first month of the 
growing season will be August* There will be a 
varying number of scenes with potential dates across 
several yesrs* Thua, on any one scene, the crop 
calaudar will need to be interpreted for that 
specific date (Interpretation consists of defining 
areas with potential corn and/or soybeans)* 
Concurrently use all statistical data that m»y be 
available In detamlnlng occurrence and density of 
corn and soybeans* Adjoining scenes with varying 
amounts of overlap may provide additional 
multltemporal information and should be used whenever 
possible* Soil type groupings may provide additional 
Infonpai^lon on delineating or extending areas of 
specific crops* Using the crop calendar, look for 
crops that are planted earliest in the year ahead of 
ocher crops* These should show up first on a 
sequence of images for the same footprint and you may 
be able to determine that crop's signature for the 
season* The same approach may be taken for harvest 
times when early crops are maturing and are being 
harvested* Once particular fields are identified as 
specific crops this information nay be used to 
Identify similar and nonslmllar crops* 

(4) Classify Che blocks by four groups based on 
percentage of com and soybeans* The breakdowns are 
40 percent or more of the area devoted to com and 
soybeans, IS to 40 percent, S-'IS and less than 5 
percent (See appendix 3, Coding Sheet)* If the 
density or potential density of com and soybeans 
varies substantially within the block, additional 
blocks should be fortMd* The density of com ud 
soybeans is Independent of the percentage 
agriculture, namely, of the cultivated land In Che 
block 40 percent la potential corn and soybeans, and 
so forth* 

(5) A short narrative should be developed by Che analyst 

describing the cropping practice and calling 
attention to specific areas that will aaalst Che 
cartographic personnel In subsequent stepe* In 
addition CO Che narrative, the analyst nay also write 
Information about Interpretation on the overlay 
materials. These connents may entail such Itens as 
saying the light pink colored areea are grassland, 
while the red field on Scene # and the brown 
fields on Scene # are thought to be com and 
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soybeans and tha whits 'islda on Scans # ara small 
grains* For notes referring to specific blocks of 
land, a temporary work number identifying the block 
should be entered on the over.'.ay and used to 
reference the appropriate note or comment (see 
section II-3)* 

3* Second Carto Procedure 

(1) Place a new, clear overlay over existing 
transparencies* Align and label this second 
overlay. 

(2) Using green lumocolor, draw In stratum blocks using 
straight lines %rhere possible, to facilitate 
digitising* Code each block with the appropriate 
codes for percentage land cultivafed and percentage 
of corn and soybeans* 

(3) The next step is to classify the blocks as to 
predominant field sizes* The three major breakdowns 
being fields less chan 100 acres, fields 100*300 
acres and fields greater chan 300 acres* A temolafe 
should be used for questionable or marginal blocka* 
Use appropriate codes from the AgRISTAHS coding 
sheet* 

(4) Code Che blocks of land according to majur sol. a 
types and enter ode on the overlay* We will code 
only Che major soil types found in Che block (see the 
coding sheet). Subdivide Che block to improve Che 
accuracy of the soil clasaiflcation if possible to do 
so wl'ihouC violating the minimum block else (9 sq. 
mllei, 3/4 Inch^)* Each tine a block ia 
subdivided, all coue data should be entered into ench 
new block* 

4* Second Analyst Review 

(1) Review the final carto stratification and make any 
necessary corrections in purplb lumocolor* Base the 
review on all available materials, notes, and 
ancillary data* 

(2) Review for consistency in stiatification and make any 
necessary corvections to ensure that consistency* 

5* Final FrasM Preparation 

(1) Place the final frame material over Che base nap and 
the reviewed and checked second carto overlay (remove 
all other overlays) and draft the final frame. Bring 
forward the appropriate codes for percentage 
cultivated, percentage corn/soybeans, field size and 
soil type* Assign a unique threo digit code to each 
strata block by numbering the blocks starting with 
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001 in th« northeast corner of the one degree work 
unit in a serpentine fashion until all blocks are 
nunberad* The code data to be entered In each strata 
block Is as follows: 

y.l\X - unique strata block number 
X - percent cultivated 

X - crop specific code 

X - field size 

XX - soil type 

X - check digit 

(2) Compute the check digit and enter. The check digit 
Is the unit portion of the sum of the individual 
digits* Fnr example if the block had been coded 
2A7 21212 the check digit would be computed as 

(2+4+7+2+1+2+1+2) - (21) - 1 
Thus the complete code would be 24721212^ 

6* Supervisor Review 

Review for completeness and neatness. 

7. Statistician Review 


Foti? 1 acceptance of work unit 
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APPENDIX 2 

A^RISTARS 

STRATtFICATtON PROCEDURE AND ANALYSIS RECORD 


Country 

Lit Pollticnl Subdlvl)ilon(s) 
2nd Political Subdlvision(a) 


Crop Varlablaa (1) (2) (3) 

Work Unit Nuabar Coordlnatna 


Footprint 
Data : 


Footprint 

Data 


Footprint 
Data : 


Footprint 

Data; 


Scana(a) uaad aa baaa tor stratification: 
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FIRST STRATA OVERLAY 
C«rto N«t«* 

Data Scartad D«t« Ftnlahad 

Carto Notaa: (Orlantaeion, Allgnaant. Scana Quality) 


Gaographlc Characearlatlcs of Work Unit: 


Cliaata of Work Unit: 


Crop Calandar: Aetachad 

Crop Data Shaat: Aetachad 

Ralatad Napa, Othar Natarlala 


Carto Notaa: (Ralatad to land uaa analyala and agrlcultura) 


Ih 


Analyse Name 
DACe .^carted 


ONNalNAl. L 
or QUAl »TY 


Data Complaead 


Analyst Notas: 


ORIQINAL Pl^ !S 
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SECOND STRATA OVERLAY 
Carco Name 

Date Started Date Finished 

Carto Notes: 


STATISTICIAN REVIEW 

Name 

Date Started Date Finished 

I'iotes : 


FINAL FRAME 
^arto Name 

Date Started Date Finished 


SUPERVISOR REVIEW 

Name 

Date Started Date Finished 
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1. Count Unit ID - enter sequential 001 to nnn digit(s) 
starting in northeast corner in east to west, west to east 
serpentine manner* 

2. Percent Cultivated (Final Stratification) 


Carto 

Code 


Analyst 

Code 

0 

1 


Description 

not sampled (water, other countries)* 

Less than 5X cultivated, no potential 
for agriculture, usually arid, steep 
mountainous terrain. 

A 2 Less than 52 cultivated, little If 

any potential for agriculture, 
usually woods, brush and marsh* 

3 Less than 52 cultivated, has 

potential for agriculture, usually 
grassland* 

4 4 5 to 202 cultivated* 

5 5 20 to 402 cultivated. 

6 6 40 to 602 cultivated. 

7 7 60 to 802 cultivated* 

8 8 80 to 952 cultivated* 

9 9 952 or more cultivated* 

Percent Corn and/or Soybeans 

Code Description 

0 not sampled (water, other countries). 

1 Less than 52 corn/soybeans* 

2 5 to 152 corn/soybeans* 

3 15 to 402 corn/soybeans. 

4 402 or more corn/soybeans* 


4* Field Size 
Code 
0 


Description 

not sampled (water, other countries). 
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1 . Fields less then 100 seres* 

2 Fields 100 to 300 acres* 

3 Fields greater than 300 acres* 

5* Soils Type 

Description 

not sampled (water, other countries)* 

AH - Hydromorphie Alluvial Soils* 

A|{A - Hydromorphie Alluvial Solis 
with well drained acid Alluvial 
Soils* 

Af}RB T/A -(No narrative description 
provided ) 

CR R/L - Chernosem on loessreovered 
undulating to rolling plains* 

CHi,T/A - Loamy Chernosem soils on 
alluvial plains * 

G-HB - Gray Hydromorphie soils with 
half bog on nearly level plains* 

LFsR/Sd - Sandy Low - Ru^c 
Ferruginous Latosols from Sandstone 
on rolling terrain* 

08 LFg R/SL -(No narrative description 

provided ) 

09 LFg R/T - Sandy Lov-Rumlc 

Ferroglnous Latosols from 
consolidated rocks on rolling 

’ terrain 

10 LR R/C - Red Latosols from 

undifferentiated consolidated 
rocks on hilly terrain* 

U LR R/S - (LR R/TS) - Red Utosols 

from sedimentary rocks on hilly 
terrain* 

12 LR M/ - Llthosollc Red Latosols on 

mountains of humid tropics* 

13 LR R/C - Red Latosols from 

undifferentiated consolidated rocks 
on rolling uplands* 


Code 

00 

01 

C2 

03 

04 

05 

06 
07 
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14 LR R/U - Red Latosols from uncon* 

solldated materials on rolling 
terrain* 

15 LTX R/B - Terra Roxa (Latosols) from 

basic rocks on rolling terrain* 

16 M * Dominantly swampy land but 

locally Includes Islands of better- 
drained land* 

17 ~ Planosollc Chernozem 

soils from lacustrine and materials 
with Solonchak* 

18 PR R/S - Prairie from sedimentary 

rocks on undulating to rolling 
plains* 

19 PR R/U - Prairie from unconsolidated 

materials on rolling plains* 

20 RBu ~ Reddish brown from uncon- 
solidated materials In level to 
rolling plains* 

21 Reddish brown from unconsolidated 

materials on level to rolling plains, 
with Solonchak* 

22 RGpU ~ Regur soils fron fine* 

textured unconsolidated materials on 
level to gently sloping plains* 

23 RL H/B * Reddish broira laterltlc from 

basic igneous rocks on hilly 
terrain* 

24 RL H/S - Reddish brown laterltlc from 

sedimentary rocks on hilly terrain* 

25 RL R/B * Reddish brown laterltlc from 

basic Igneous rocks on rolling 
plains 

26 RYP H/C * Red and yellow Podzolic 

soils from consolidated rocks 

on hilly terrain* 

27 RYP R/SdH - Red and yellow Podzolic 

from sandstone on undulating to 
rolling terrain, marshes and ponds* 

28 RYP T/A M - Red and yellow Podzolic 

on terraces with marshes and ponds* 
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29 

RZ H/B - Rubrescma and aaaoelatad 
tolla from basic rock on hilly 
tarraln* 

30 

RZ H/g - (No narrative daecrlptlon 
provided ) 

31 

RZ R/B - Rubrosems from basic rocks 
on rolling terrain# 

32 

RZ R/S - Rubrosems from eadlmantary 
rocks or rolling topography* 

33 

RZ RH/C - Rubrosems from consoli- 
dated rocks on rolling and hilly 
terrain* 

34 

SB - Sandy Beaches 

35 

SM-A)is -Salt water marsh with 
Hydromorphlc Saline Alluvial soils* 

36 

w - Ulesenboden (flat to nearly level 
plains) * 

37 

PPR Y/S - (No narrative description 
provided ) 

38 

PR Y/S - (No narrative description 
provided ) 

39 

RG R/B - (No narrative description 
provided ) 

40 

AH-AA - (No narrative description 
provided ) 

41 

RYP R/TS - (No narrative description 
provided ) 

42 

RYP R/SD'4< -(No narrative descrip- 
tion provided) 
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Municlplo (a county sized political subdivision) boundaries were 
digitized to provide a file which could be used to assign 
munlclpio codes to the sampling unit file and to be used in -the 
computation of crop densities for these regions. 

Estimates of area planted to corn, soybeans, rice, kidney beans, 
cotton, and sugar. There was also a category for other 
unspecified crops such as wheat, coffee, and so forth. However, 
total land was not available at this level. 

Total land area was estimated by digitizing maps supplied by 
NASA. These maps showed munlclpio boundaries and latitude and 
longitude lines* The original 1:1,000,000 maps were enlarged 
and reproduced on clear acetate at 1:250,000* In order to form 
our 1° square work units, mosiacs were made and the work units 
traced on paper copy. The tracing of work units was necessary 
because many 1** squares were on parts of two or more maps* 
Further, the dots used to indicate places to digitize were not 
clearly visible when placed on the acetate because of the thick 
black boundary lines which were caused by enlarging the small 
scale maps* 

Estimating total area by munlclpio posed several problems which 
had to be resolved* The siaps showing the munlclpio boundaries 
had to be drafted by NASA and were received late in the 
construction phase* These naps contained a few munlclpios that 
had been incorrectly coded, some boundaries had to be estimated 
where maps overlapped, and eight munlclpios were omitted 
entirely* 

Errors In digitization also had to be resolved* The digitizer 
boards and microprocessor were recently purchased and software 
had to be developed for the system* Time was needed for 
training in the use of the equipment* The disk storage capacity 
of the minicomputer limited the amount of data which could be 
processed at any one time* 

Munlclpio areas were digitized by work ubits* This procedure 
created special problems because many munlclpios had parts In 
more than one work unit* These parts had to be aggregated to 
obtain the total area for the munlclpio* Thus, a method was 
devised for determining when all parts of a municlplo had been 
digitized* Only then could the areas be summed to obtain the 
total required* 

The first step in aggregating municlplo areas was to obtain 
computer listings of the dlgltizec areas (In square Inches) by 
work unit. Each work unit listing was manually matched w.th a 
map which showed municlplo boundaries. If a municlplo fell In 
more than one work unit, each part was numbered and coded so 
that the part could be associated with the particular work unit 
which contained that portion. The total number of parts and 
the specific part numbers for each municlplo was recorded. 
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Thaso values along wlcti an Idenclficaclon (ID) variable and 
the digitized area were used as Input for the update program. 
This update program created variables which Identified each 
work unit containing a portion of a particular munlclpio and 
calculated the munlclpio area when all parts of a munlclpio had 
been processed. The master file was then updated. After each 
run, printouts showing the crop area estimates and the crop 
percentages of the total area were obtained. The printouts were 
checked against the map to verify that munlciplos were cor- 
rectly shown as completed. Whenever discrepancies occurred, 
such as improper coding of the parts of the munlclpio falling 
In different work units, omission of munlclpio boundaries, 
duplication of munlclpio mimbers, or errors In the Identifica- 
tion variable, the problem was resolved. 

All of the preceding data manipulations were required to obtain 
ancillary data In a fora that was useful for developing the 
area frame. Since strata were defined using the percentage of 
agriculture and percentage of com and soybeans, the ancillary 
data also needed to be In percentages. The analyst* used the 
percentages to check signature Interpretations for consistency 
within and among strata. The variable, percentage com and 
soybeans, was computed by summing areas planted to those two 
crops and dividing by the sums of areas planted to all crops. 
After arriving at the estimated total area we computed the 
value for percentage of cultivation by dividing the sum of the 
crop areas by the total land area. 
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SOFTWAKi: DOCUMENTA- 
TION 

PROGRAM DESCRIPTIONS 

Polygon Control 
Nuaber Vtriflcgtlon 


Area Check 


Calibration Check 


Point File Main Edit 


Point Averaging 


Point File Cen- 
veraion 


OF 


QUALITY 


Tlie programs n«*cos8ary for tho development of the sampling frame 
were written l>v Bob Hale and Ron Fecso of the SFDS, SRS, USDA In 
early 1980< ITiese programs edit and manipulate the raw data in 
the digitized files and then convert the edited data into the 
format required for automated sampling of the frame* 


The polygon control number identifies each land uae block and is 
made up of a unique block number, the vector of information, and 
a check digit. The check is defined as the units value of the 
sum of the digits making up the block number and the vector of 
information. The digits of the polygon control number are 
summed and the units digit is compared to the check digit. If 
the values do not match a warning message is Issued. The coding 
for each flagged polygon must be rechecked and corrected. 

The digitized area of the polygons is aggregated and compared to 
the total area of the work unit. If the areas differ by more 
than .2 percent an error message Is issued. The points in the 
file must be verified and any corrections made by the person who 
digitized the work unit. 

The four corner points in the calibration file are compared to 
the four corner points in the point file. If either the X or Y 
coordinates of the matching corner points differ by SMre than 
.025 inch, the calibration file must be redone. 

Matching points are checked and the number of matching points is 
verified. Hatching points are defined to be points coincident 
to contiguous polygons. For example, if matching points make up 
the coincident boundary of two contiguous polygons each of the 
matching points would have to be recorded twice in the 
digitizing process in order to define the boundaries of both 
polygons. If the X and Y coordinates of matching points differ 
by more than .025 inch they are considered to be nonmatches. 

All points not having the required number of statches are output 
in an error file. Corrections are made by the person who 
digitized the work unit. 

The X and Y coordinates of matcliing points are averaged and 
aasigned the average value of all matching points in the point 
file, 

The X ard Y coordinates of the points in the point file are 
conver,:ed to latitude and longitude. The conversion uses a 
line if relationship determined by the four points in the 
calibration file. 


Copies of chese programs are available upon request. 
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Work Unit Grldding Th« concer points of the work units are used to form a (trid of 

sample units covering the area of interest. The auxiliary 
information assigned to each sample unit, is determined by tho 
location of the center points. F.ach sample unit Is assigned 
the information corresponding to the polygon in which the 
center point falls. 
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Ctoaattic Arguwnta Lines of longitude correspond to the Intersection of the surface 

of the earth with planes which contain th'; polar axis* Lines of 
latitude are the Intersection of planes with the surface of the 
earth such that the polar axis is perpendicular to the plane. 
Treating the earth as an ellipaoid of rotation (O (Clark 1866 
Ellipsoid also known as 1927 North Aaerlcan Datum), the 
following parameters are recuired: 


1. Equatorial radius 

2. Polar radius 

3. First eccentricity 

4. Second eccentricity 


a “ 6,378,206 meterH 
b “ 6,356,584 meters 
e, ■ (a2-b2)»5/a 
e^ - (a2-b2)‘5/b 


Then, the distance on the surface of the earth from the equator 
to a specified latitude, t. Is (In series expansion) 

S(t) ■ Sot + axsin(2t) + a2Sln(4t) + . . . 

where 


So • a(l-e2) (l+n/A)e2 -h (45/64)e^+ (175/256)e6 
(11025/16384)e8 + . . .) 

ai- “(l/2)a(l-e2)((3/4)e2 + (15/16)e^ + 

(525/512)e^ + <2205/2048)e8 + . . .) 

a2 - (l/4)a(l-e2 )((i5/64)e^ + (105/256)e6 + 
(2205/4090)e8 + . . .) 

For the creation of AgRISTARS sampling units, the inverse of 
S(t) is neces&ary. Thus, using the following formula we can 
compute the latitude (in radians) of a specified distance, K, 
from point t^ along a line of longitude. 

'.(H.tj) - ti + (S-Sq) Oj + (S-S<,)2 q2 + . . . 

with H - (5 -Sq) - (S-S(ti)), 

qi - l/(ao + 2ai cos(2ti) + 4a2 coe(4ti)). 


and 


q 2 ■ (1/2) qj (4ai sin(2t^) + 16a2 sln(4tj). 

The distance along a line of latitude, t*T, between two lines of 
longitude is determined by 

D((T.g 2 ), (T,gi)) • (r cos T) (g 2 - gi) 

where g 2 and g^ are the lines of longitude and 
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r"(<(co*^o)/*2) + ((sln2o)/b2)) *2 


Creating the 
Saapllng Frame 


Thus, the angle of longitude (in radians) subtended by an arc of 
length, L, along the line of latitude t^ is 

G(L,ti) * L 

(r cos(ti)) 

Each of the distance functions has an inverse mapping in vhic,., 
given a point, P^, and a desired distance along a given 
latitude or longitude, the latitude and longitude of the point 
can be computed. To grid a spherlod using these distance 
measures and inverses, it is necessary to define starting points 
(^) . The equator and the 0* line of longitude was used for 
convenience in computer applications* 

In this section the approa'.h for creating a "grid" system of 
sampling units is outlined. The grid system ot sample units can 
be established by first defining planes of latitude which slice 
the surface of the sphere into strips which have latitude 
defined by t^, such that, 

S(ti+i) - S(ti) - H 1 - 0. 1, 2 1 

where II • height of desired sample unit, 

tg • 0 (The equator), 

t£+2. ^3 greater than t^, and 

t^ is less than (2/9) radians (approximately 80<’). 


Note that the slicing algorithm for general application is done 
from the equator toward the North Pole. The appropriate 
reflections are used to define the slices in the Southern 
Hemisphere. 

After the strips which will contain the sample units are 
defined, all that is required is to slice the strip into the 
appropriate length (L) sample units. To do this the algorithm 
starts the slicing by placing the westernmost edge of the first 
sample unit approximately along the O*’ line of longitude. The 
center line of latitude is c<mputed for the strip and Increments 
of length equal to the sample unit length are made along the 
center line. Thusi the center points of this latitude/longitude 
slicing process define the sample units. 

To understand the grlddlng algorithm, note that there are (I ■h D 
unique values for latitude, t£, which slice the quadrant. For 
each of the J *■ 1, 2, . • 1 "center latitudes" of the slices 

there are Hj values of longitude in the quadrant which "chop" 
the "slice" into sampling units. Notice that Nj is 
nonincreasing as the center latitudes, Increase. 
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The Yj are found incrementally by first defining tg “ 0 and 
then incrementing along g » 0 to find t]i, t2» • • • > tj. 

The latitude of the center points of the sample units, Yj, is 
found by: 

Yj « (tj + tj_i)/2, j - 1, 2 I 

where 


tj - T(H, tj-i). 

The Nj sample units along latitude Yj are computed 
iteratively as follows* First let 

go" 0* 

Then the line of latitude is chopped into lengths the size of 
the sample units by creating 

8k “ 8k-l + G(L,Yj), k - 1, 2, . . *, Nj. 

Thus the center point longitude of the sample, unit, X]^, are 
computed as follows: 

^k ■ (8k 8k-l)/2, k • 1, 2, * . . Nj. 

As a result, the iteratively created collection of ordered pairs 

(Xk. Yj) 

define the center point expressed in terms of latitude and 
longitude for each sample unit in the quadrant. It should be 
noted that the actual program creates a file of sample units 
which consists erf the intersection of these center points and 
the area for which the frame is being created. That is, only 
the sample units in the target area are created. 


